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IMPROVED EXPRESSION OF POLYPEPTIDES 

TECHNICAL FIELD OF INVENTION 

This invention relates to processes and 
5 intermediates for improving the level of production of 
a desired polypeptide in a recombinant host. More 
particularly, this invention relates to an "island of 
expression" — a segment of DNA which contains a DNA 
sequence encoding a heterologous polypeptide — and the 

10 use of the island of expression to transfect a host. 
Hosts harboring this island of expression produce a 
surprisingly high level of the desired heterologous 
polypeptide • Incorporation of the island of expression 
into a host permits the desired heterologous 

15 polypeptide to be expressed substantially independent 
of its position of integration in the host genome and 
substantially dependent on the number of copies of the 
island of expression which integrate into the host 
genome • 

20 BACKGROUND ART 

It is well known that polypeptides can be 
expressed and secreted by hosts transformed or 
transfected with a DNA sequence coding for that • 
polypeptide. For example, Gilbert et al.. United 
25 states Patent 4,565,785 (1986) and L. Villa-Komarof f 
et al., "A Bacterial Clone Synthesizing Proinsulin", 
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Proc. Natl. Acad, Sci, USA . 75, pp. 3727-31 (1978) have 
shown that a selected polypeptide can be synthesized 
within a bacterial host and excreted through the host 
membrane. A similar process can be carried out in 
5 animal cells. J. Doehmer et al.^ "Introduction Of Rat 
Growth Hormone Gene Into Mouse Fibroblasts Via A 
Retroviral DNA Vector: Expression And Regulation", 
Proc. Natl. Acad. Sci. USA , 79, pp. 2268-72 (1982). 
Recombinant proteins have even been expressed in 

10 mammals through transgenic incorporation of an 

expression system into the pronucleus of a fertilized 
embryo. D. Bucchini et al., "Pancreatic Expression Of 
Human Insulin Gene In Transgenic Mice", Proc. Natl, 
Acad. Sci. USA , 83, pp. 2511-15 (1986); K. Gordon 

15 et al., "Production Of Human Tissue Plasminogen 

Activator In Transgenic Mouse Milk", Bio /Techno loav . 
5(11)/ pp. 1183-87 (1987). 

However, to date, none of these techniques 
has been consistently successful in permitting large 

20 amounts of a desired heterologous polypeptide to be 
expressed by a host which has integrated into its 
genome a heterologous polypeptide encoding sequence. 
This is particularly surprising in view of the high 
level of native protein production occasioned from the 

25 very same expression control sequences in their native 
environments. For example, milk specific expression 
control sec[uences permit large amounts of native 
proteins, e.g., casein, to be produced in and secreted 
from mammary glands. The very same milk specific 

30 expression control sequences, however, have not been 
demonstrated to induce large amounts of heterologous 
polypeptides when operatively linked to heterologous 
polypeptide encoding sequences. See, for example, C.W. 
Pittius et al., "A Milk Protein Gene Promoter Directs 

35 The Expression Of Human Tissue Plasminogen Activator 
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cDNA To The Mammary Gland In Transgenic Mice", Proc. 
Natl, Acad, Sci, USA . 85, pp. 5874-78 (1988). The 
level of expression in these latter constructions is 
also independent of the niomber of copies of the 
5 heterologous polypeptide encoding sequence integrated 
into the host genome. Furthermore, the level of 
expression is subject to positional effects, i.e., it 
is dependent on where the heterologous polypeptide 
encoding sequence is integrated into the genome. K.F. 
10 Lee et al. , "Tissue-Specific Expression Of The Rat 
Beta-Casein Gene In Transgenic Mice", Nucleic Acids 
Res. , 16(3), pp. 1027-41 (1988). 

Accordingly, the need exists for a method of 
increasing the expression of DNA sequence encoding a 
~ 15 heterologous protein or polypeptide independent of its 
site of integration in the host genome. Moreover, such 
methods should provide expression that is dependent 
upon the number of copies integrated into the host 
genome so that expression levels may be controlled. 

20 DISCLOSURE OF THE INVENTION 

The present invention solves these problems 
by providing an "island of expression" containing a DNA 
sequence which codes for a desired heterologous 
polypeptide. The island of expression of this 

25 invention provides for the first time, high level, 
position-independent and copy number-dependent 
expression of a DNA sequence coding for a heterologous 
polypeptide. 

As is depicted in Figure 1, the island of 

30 expression of this invention comprises, in the 5' to 3 • 
direction, a 5* flanking region, a heterologous 
polypeptide encoding sequence (coding for the desired 
heterologous protein or polypeptide) and a 3 • flanking 
region. The 5' flanking region comprises, in the 5* 
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and 3 
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wo 91/13151 



PCT/US91/01222 



- 5 - 

the 5« end of the heterologous polypeptide coding 
sequence* The signal sequence coding region is also 
operatively linked to the expression control secjuences 
so as to permit a host whose genome carries this 
preferred island of expression to produce, secrete, and 
preferably process, the desired protein or polypeptide 
from the pre-protein or pre-polypeptide coded for by 
the combined signal-heterologous polypeptide coding 
sequence. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts a schematic representation 
of a typical "island of expression" (A) and a preferred 
••island of expression'* (B) in accordance with this 
invention. 

Figure 2 depicts the construction of a 
plasmid (CAS1288) containing the 5« and 3« flanking 
regions of bovine alpha S-1 casein. 

Figure 3 depicts the introduction of the 
urokinase structural gene into CAS1288 to yield 
CAS1295, the island of expression. 

DETAILED DESCRIPTION OF THE INVENTION 

In order that the invention herein described 
may be more fully understood, the following detailed 
description is set forth. 
25 In this description the following terms are 

employed: 

Expression control sequences — DNA sequences 
that control and regulate expression of gene products 
at both the transcriptional and trans 1 at ional level 
30 when operatively linked to a structural gene (DMA 

coding for a polypeptide) . Tliey include the promoter 
and enhancer regions, ribosome binding sites. 



5 



15 



20 
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polyadenylation signals and other sequences useful in 
the expression of genes. 

Operatively linked — the linking of 5» and 
3 • expression control sequences to a heterologous 
5 polypeptide encoding secjuence so as to permit the 
expression control sequences to control and regulate 
the expression and production of the heterologous 
polypeptide. 

Heterologous polypeptide encoding sequence — 

10 a DNA sequence coding for a desired polypeptide or 
protein that is inserted into the genome of a host. 
This DNA sequence codes for a polypeptide which is 
heterologous to either the host, the flanking sequences 
or both. The heterologous polypeptide encoding 

15 sequence optionally contains its own translational 
start signal at its 5» end and its own translational 
stop codon at its 3 • end. The heterologous polypeptide 
encoding sequence may also contain its own signal 
sequence coding region. 

20 Signal sequence coding region — a DNA 

sequence which encodes a sequence of typically 
hydrophobic amino acids called a signal peptide. The 
signal peptide allows a polypeptide to which it is 
attached to cross a biological membrane. 

25 Island of expression ~ a DNA construct 

comprising in the 5' to 3* direction, a 5' flanking 
region, a heterologous polypeptide encoding sequence 
and a 3" flanking region. The 5' and 3» flanking 
regions are of sufficient size and structure to render 

30 the level of production of the desired protein or 

polypeptide substantially dependent on the copy number 
of the island of expression construct incorporated into 
the host genome and substantially independent of the 
position of integration of the island of expression in 

35 the host genome. 
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5* flanking region — is that part of the 
island of expression which is 5* to the heterologous 
polypeptide encoding secfuence. It includes, in the 5* 
to 3' direction, 5* expression control sequences and a 
5 5' untranslated region, the expression control 

sequences being operatively linked to the heterologous 
polypeptide encoding secjuence. The 5* untranslated 
region typically extends from a transcription 
initiation site to the translational start site of the 

10 heterologous polypeptide encoding sequence. 

3* flanking region — is that part of the 
island of expression which is 3 • to the heterologous 
polypeptide encoding sequence. It includes, in the 5' 
to 3* direction, a 3* untranslated region, and 3» 

15 expression control sequences. The 3* flanking region 
may also include all or a portion of the coding 
sequence from the structural gene originally associated 
with the 3* flanking region. 

DETAILED DESCRIPTION OF THE INVENTION 

20 Although not wishing to be bound by theory, 

we believe that, the island of expression allows as yet 
undefined factors within the 5* and 3' flanking regions 
to operate on the expression control sequences and to 
permit the heterologous polypeptide encoding sequence 

25 to be expressed at higher yields. Expression is also 
dependent on the number of copies of the island of 
expression construct incorporated into the host genome, 
thus allowing the level of polypeptide production to be 
modulated* 

30 The large 5* and 3» flanking regions of the 

islands of expression of this invention may also 
provide a buffer zone so that the expression control 
sequences are isolated from host expression controls 
which may be exerted by the surrounding DMA into which 



wo 91/13151 



PCr/US91/01222 



the island of expression has integrated. Therefore, no 
matter where in the host genome the island of 
expression integrates, the heterologous polypeptide 
encoding sequence will be expressed at a high level, 
5 It carries its own genomic environment along with it, 
as an "island of expression" • 

Although not wishing to be bound by theory, 
we believe that the majority of regions of DNA which 
may enhance expression from expression control 

10 sequences are found in the 5' and 3' flanking sequences 
of a given structural gene. Therefore, after isolation 
of a structtiral gene with its 5 • and 3 • flanking 
regions, the structural gene, in accordance with one 
embodiment of this invention, may be excised in whole 

15 or in part and replaced with any heterologous 
polypeptide encoding sequence so as to permit 
expression at a level consistent with that of the 
original structural gene. Alternatively, the 
heterologous polypeptide encoding sequence may be 

20 inserted at the 5* end of the structural gene without 
concomitant removal of that gene. In that embodiment, 
the heterologous polypeptide encoding sequence will 
also be expressed at a level that is comparable to the 
expression level of the original stmictural gene. 

25 Among the expression control sequences useful 

in the various embodiments of this invention are those 
which direct expression at high levels in particular 
types of cells or at particular stages of cell growth 
or differentiation, or under specific culture 

3 0 conditions. Tissue-specific expression control 

sequences are preferred in the transgenic hosts of this 
invention. 

If mammalian host cells are utilized, useful 
expression control sequences may be derived from native 
35 sequences encoding a highly expressed product from the 
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host cell itself, or they may be derived from other 
eukaryotic genes with high levels of expression, such 
as ^-actin, collagen, myosin, albumin, metallothionein 
and human growth hormone* 
5 A preferred embodiment of this invention 

provides for the production of proteins in transgenic 
mammals. This embodiment preferably uses expression 
control sequences which control and direct expression 
of gene products in mammary tissue, such as expression 

10 control sequences corresponding to casein promoters and 
the beta lactoglobulin promoter. The casein promoters 
may, for example, be selected from an alpha casein 
promoter, a beta casein promoter or a kappa casein 
promoter. More preferably, the casein promoter and 

15 associated expression control sequences are of bovine 
origin and most preferably are an alpha S-1 casein 
promoter and associated expression control sequences. 

Expression control sequences may even be 
derived directly from the cells which are to be used as 

20 the host for the island of expression construct. A 
promoter and associated expression control sequences 
having the desired level of activity in the host must 
first be identified. The island of expression must be 
designed so that each island of expression construct 

25 which integrates into the host genome is expressed in a 
copy number-dependent, posit ion- independent manner. 

We describe here a means of identifying 
expression control sequences, cloning the required 
flanking regions containing these sequences, adding the 

30 heterologous polypeptide encoding sequence, and testing 
whether the resultant construct is an "island of 
expression" in accordance with this invention. 

The first step is to determine a host and 
conditions which allow a gene homologous to that host 

35 to be expressed at a desired level or at specific 
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times. 



In the case o£ tissue culture, CHO cells 
5ro„xn, on the collagen beads found in the 
systeB are preferably used. 

s for a bo.or:,o::':::.*L\nr"'°'' ""'^-^ "^-^-^ 

in host cells undHe^^^ed co^fr"" ^'^^^^ 
expressed species Lst t IIT^.^^^":; " "y 
achieved by preparing a cOK. lib:::;^^ pl^;: ^ " 
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^.tL-rr:—: ^^^^^^^^^ - - 
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*. • °""aant: RNAs. These genomic sequences 

cos-nlds CT. Maniatis et alj JHZ^, 

SCS^^^tSfrfTT^' spring H-;;;;^" 

restrict^n sites \^ '° 

ion sites, the aaount of flanking sequences ir, 

as SLeT" reg^ons'co^tained 

Alternatively, but less preferably the 
egression control seguence .ay be isolated ;y 
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nol^ r!^! achieved by analyzing the total 

PC y^^de'T" either SOS 

s-^^r^r::: oTth?it::si s" t"-^ - - 

9 i or the largest spot m a two-dimensional 
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gel. Once identified, the band or spot is excised from 
the gel, eluted, and sxibjected to automated protein 
sequencing. Oligonucleotides based upon the amino acid 
sequence obtained from the protein sequencing are then 
5 synthesized. These oligonucleotides can then be 
labeled and used as probes to identify their 
corresponding genomic sequences from a cosmid library 
constructed from host cell DNA. 

Once a sufficiently detailed restriction map 

10 of this abundantly expressed gene has been determined, 
the coding sequences and intervening sequences of the 
structural gene may be removed from the cosmids, for 
example, with appropriate restriction enzymes and 
replaced with the heterologous polypeptide encoding 

15 sequence. Alternatively, the heterologous polypeptide 
encoding sequence may be inserted 5* to the structural 
gene. In this embodiment, the structural gene need not 
be excised. According to a preferred embodiment, the 
heterologous polypeptide is urokinase, the DNA sequence 

20 of which has been isolated and cloned from a genomic 

library using published sequences as probes. A. Riccio 
et al., "The Human Urokinase-Plasminogen Activator Gene 
And Its Promoter", Nucleic Acid Res.. 13(8), 
pp. 2759-71 (1985). 

25 The resulting construct has the DNA sequence 

coding for the heterologous polypeptide flanked on both 
sides by the genomic sequences of the abundantly 
expressed gene which was originally isolated from the 
host cells. Constructs containing the various lengths 

30 of 5« and 3» flanking sequences must be tested to 

determine what size flanking regions are necessary to 
direct expression of the heterologous polypeptide 
encoding sequence in a copy number-dependent, position- 
independent manner. 
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To determine that the isolated cosmid 
contains sufficient 5* and 3' flanking regions to 
permit an inserted heterologous polypeptide encoding 
sequence to be expressed at substantially the same 
5 level as that of the highly expressed homologous DNA 
sequence^ the selected DNA sequence is transfected into 
cells in tissue culture or introduced into the genome 
of an embryos to produce transgenic animals. 
Preferably^ the cells or embryo that will be used for 

10 ultimate production are employed in this step. The 

transformed hosts are then tested for the expression of 
the heterologous protein by any of a number of well- 
known assays. These include, but are not limited to, 
radioimmunoassay, ELISA, immunoblotting and assays 

15 which measure the activity of the desired polypeptide. 
Alternatively and preferably, mRNA levels under a 
variety of growth conditions are used. This may be 
achieved by the Northern blot technique using the 
previously described oligonucleotides (corresponding to 

20 the polypeptide sequences) or the cDNAs identified 
previously as probes. 

Because the expression control sequences 
selected from the host cells demonstrate the ability to 
direct expression of the homologous gene at a high 

25 level under known conditions (e.g., CHO cells growing 
on collagen beads in the VERAX~ system) , it is expected 
that substantially the same level of expression of the 
heterologous polypeptide would be seen under those same 
conditions. Should the cosmid derived DNA sequence not 

30 provide such level of expression, then other cosmids 
containing different lengths of 5« and 3» flanking 
regions should be analyzed in substantially the same 
way until an appropriate DNA sequence is located. 

The levels of production of the heterologous 

35 protein adduced by this sequence are then compared to 
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the copy nximbers of the integrated island of 
expression. Copy number is determined by appropriate 
restriction enzyme analysis* The expression constructs 
which show position-independent^ copy number-dependent 
5 expression are the optimal "islands of expression" in 
accordance with this invention* 

According to a preferred embodiment the 
desired polypeptide is secreted by a host harboring an 
island of expression of this invention* Secretion of 
10 polypeptides is accomplished by fusing a DNA sequence 
coding for a signal peptide to, and in reading frame_ 
with^ the DNA encoding the heterologous polypeptide. 
The size of the signal peptide is not critical for this 
invention. All that is required is that the signal 
15 peptide be of a sufficient size and sequence to effect 
secretion of the heterologous polypeptide. The signal 
sequence encoding the signal peptide may be exemplified 
by signal sequences associated in nature with the 
expression control sequences, signal sequences 
20 associated in nature with the desired heterologous 
protein or polypeptide, signal sequences which are 
native to the host, signal sequences which are native 
to the source of the heterologous polypeptide, signal 
sequences which are native to the source of the 
25 expression control sequences and any other sequences 
encoding functional signal peptides. 

Many of the proteins to be expressed are 
normally secreted and will have their own signal 
peptide which should be adec[uate to direct secretion* 
30 In this case, the DNA encoding that signal may be 
included in the heterologous polypeptide encoding 
sequence that is inserted into the island of 
expression. To produce a polypeptide that is not 
normally secreted, it is possible to use a signal 
35 sequence from polypeptides which are normally secreted 
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from the host cells or from other secreted 
polypeptides. A preferred embodiment of this invention 
uses sequences encoding milk-specific signal peptides 
or other signal peptides useful in the maturation and 
5 secretxon of protein in mammary tissue. These include 
the sxgnal sequence from alpha s-i casein. if the 
heterologous polypeptide to be expressed is associated 
xn nature with its own signal sequence, the signal 
sequence associated in nature with the heterologous 
10 polypeptide coding sequence is the more preferred 
signal sequence. 

The necessary 5 • and 3 • flanking regions are 
Characterized by the ability to cause expression from 
the xsland of expression construct to be position- 
15 xndependent and copy number-dependent. The length of 
proofrt"^'"^ sequences is not critical as long as these 
propertxes are conferred to the expression construct. 
The upper size limit is defined by the ease of 
nianipulating the DNA. m the original source of the 
expression control sequences (in the animal or in the 
cell ixne) , the expression control sequences are 
flanked, in theory, by the whole chromosome. Present 
techniques allow the ready manipulation of 40-50 kb 
segments of DNA. This requires the use of well-known 
25 cosmxd technology. There may also be a limit on thl 

used in embryo manipulations. The preferred technique 
xs to use as large 5- and 3- flanking regions as 
possible to insure enough insulating region to confer 
30 copy number dependence and position independence. 

The coding sequence of the desired 
heterologous polypeptide can be derived from either 
CDNA, genomic sequences, synthetic DNA or semisynthetic 
DNA. Among the polypeptide products which may be 
produced by the processes of this invention are for 



35 
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example^ coagulation factors VIII and IX, human or 
animal serum albumin, tissue plasminogen activator 
(tPA) , urokinase, alpha-1 antitrypsin, animal growth 
hormones, Mullerian Inhibiting Substance (MIS) , cell 
5 surface proteins, insulin, interferons, inter leukins, 
milk lipases, antiviral proteins, peptide hormones, 
immunoglobulins, lipocortins and other heterologous 
protein products. 

The desired heterologous polypeptide may be 

10 produced as a fusion protein containing amino acids in 
addition to those of the desired or native protein. 
For example, the desired heterologous polypeptide of 
this invention may be produced as part of a larger 
heterologous protein or polypeptide in order to 

15 stabilize the desired protein or to make its purifi- 
cation easier and/or faster. This may be achieved by 
inserting the heterologous polypeptide encoding 
sequence into the island of expression at a position 5" 
to, and in reading frame with, the structural gene, or 

20 portion thereof, which was originally associated with 
the expression control sequences. It will be obvious 
that such a construct requires removal of the 
heterologous polypeptide termination codons prior to 
insertion into the island of expression. 

25 Alternatively, the fusion protein coding 

region may be constructed prior to insertion into the 
island of expression. The fusion protein construct may 
comprise 2 or more heterologous polypeptide encoding 
sequences or portions therof , as long as the seqeunces 

30 are in the same reading frame. Such constructs may be 
made using techniques known in the art. The fusion 
protein may then be cleaved, if desired, and the 
desired protein isolated. The desired heterologous 
polypeptide may be produced as a fragment or derivative 

35 of the polypeptide that was originally associated with 
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practice ^ViTitZ^.r^^lt' ''^""^^^ ^ ^ 
In the art to be caoabl. ^ ''•ther, any piasmid known 
tor, and carry ing "a' e J ' "P"«ted, selected 

- suitable vehic^e\„":b^h":o":n:L:''iTr " ' 

expression of this invention tZl °^ 
Inlands Of e=n,ression of th^s inv^ 
between convenient restrictiL °" 
that they can be easily i3o\T.d'^ ^^'^'^ ^° 

25 plasmid sequences f • remaining 
host. ""^ "corporation into the desired 

island Of ZrTiTnt.:LZT°T'^ 
number of factors reco^'ad ^n ' 
30 Include, for examnlo ^^"^ " These 
vector, toxLityTf ' """""""""^ 

recovery of d! ! ""^^P-PWde products, ease of 
requirements of the hete^!,' Processing 
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a particular desired protein or polypeptide from any of 
these factors alone* Instead, a balance of these 
factors must be struck with the realization that not 
all hosts may be equally effective for expression of a 
5 particular heterologous polypeptide. 

Useful mammalian host cells may include B and 
T lymphocytes^ leukocytes, fibroblasts, hepatocytes, 
pancreatic cells and undifferentiated cells. 
Preferably, immortalized mammalian cell lines would be 

10 utilized. For example, useful mammalian cell lines 

would include 3T3, 3T6, STO, CHO, Ltk", FT02B, Hep2B^ 
AR42J AND MPCIL. Most preferable mammalian cell lines 
are CHO, 3T3, and Ltk". 

Embryos from various mammals may be used in 

15 this invention to produce transgenic animals. The 

choice of a host embryo may depend on factors such as 
desired final destination of the heterologous 
polypeptide in the animal. For example, in a preferred 
embodiment for the expression of heterologous 

20 polypeptides in mammal's milk, preferred host embryos 
are from animals which are already bred for large 
volume milk production, e.g., cows, sheep, goats and 
pigs. 

There are standard procedures for introducing 
25 the DMA of the expression construct into animal cells. 
Commonly used transfection methods include 
electroporation [H. Potter et al., "Enhancer-Dependent 
Expression Of Human Kappa Immxinoglobulin Genes 
Introduced Into Mouse Pre-B Lymhocytes By 
30 Electroporation", Proc. Natl. Acad , sci. USA, 81(22), 
pp. 7161-65 (1984); G. Urlaub et al., "Isolation Of 
Chinese Hamster Cell Mutants Deficient In Dihydrof olate 
Reductase Activity", Proc. Nat l- Acad, Sci. USA, 77(7), 
pp. 4216-4200 (1980)], protoplast fusion [R.M. Sandri- 
35 Goldin et al., f^nl , g^ll. Biol., 1, pp. 743-52 (1981)], 
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calcitua phosphate coprecipitation [F.L, Graham and A.J. 
van der Eb, "A New Technique For The Assay Of 
Infect ivity Of Human Adenovirus 5 DNA", Virology # 
52(2), pp. 456-67 (1973); A.D. Miller et al,, "c-fos 
5 Protein Can Induce Cellular Transf oiniiation: A Novel 
Mechanism Of Activation Of A Cellular Oncogene", Cell , 
36(1), pp. 51-60 (1981)] and DEAE-dextran sulfate 
mediated protocols. In addition, many variations of 
the DEAE-dextran sulfate and calcium phosphate methods 

10 exist [C. Queen and D. Baltimore, "Immunoglobulin Gene 
Transcription Is Activated By Downstream Sequence 
Elements", Cell . 33(3), pp. 741-48 (1983); CM. Gorman 
et al., "Recombinant Genomes Which Express 
Chloramphenicol Acetyltransf erase In Mammalian Cells", 

15 Mol. Cell, Biol. . 2(9), pp. 1044-11 (1982); R.S. Mclvor 
et al., "Expression Of A cDNA Sequence Encoding Human 
Purine Nucleoside Phosphorylase In Rodent And Human 
Cells", Mol. Cell. Biol. . 5(6), pp. 1349-57 (1985)] 
which may offer certain advantages. For example, 

2 0 calcium phosphate coprecipitation procedures are 

particularly effective with mammalian cells, including 
CHO cells. 

A selectable marker is usually cointroduced 
with the island of expression construct into mammalian 

25 cells as a separate piece of DNA so that those cells 
which incorporate the expression construct can be 
readily isolated. Useful selectable markers include 
dihydrof olate reductase, metallothionein, neo, got , and 
hisD among others. The selected cells are then tested 

30 for expression of the heterologous protein. 

There are also standard techniques for 
introducing the expression construct into the genome of 
a mammalian embryo. One technique for transgenically 
altering a mammal is to microinject the island of 

35 expression construct into the pronucleus of the 
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fertilized mammalian eggs to cause one or more copies 
of the construct to be integrated into the genome and 
retained in the cells of the developing mammals. 
Briefly, microinjection involves isolating fertilized 
5 ova, visualizing the pronucleus and then injecting the 
DNA into the pronucleus by holding the ova with a blunt 
holding pipette (approximately 50 tim in diameter) and 
using a sharply pointed pipet (approximately 1.5 /im in 
diameter) to inject buffer containing DNA into the 

10 pronucleus. See, for example, D. Kraemer et al., "Gene 
Transfer Into Pronuclei Of Cattle And Sheep Zygotes", 
Genetic Manipulation of the Earlv Mammalian Embrvo . 
pp. 221-27, Cold Spring Harbor Laboratory (1985); R.E. 
Hammer et al., "Production Of Transgenic Rabbits,. Sheep 

15 And Pigs By Microinjection", Nature, 315, pp. 680-83 
(1985) ; and J.W. Gordon and F.H. Ruddle, "Gene Transfer 
Into Mouse Embryos: Production Of Transgenic Mice By 
Pronuclear Injection", Methods in Embrvoloav. 101, 
pp. 411-33 (1983). 

20 Microinjection is preferably carried out on 

an embryo at the one-cell stage, to maximize both the 
chances that the injected DNA will be incorporated into 
all cells of the animal and that the DNA will also be 
incorporated into the germ cells so that the animal's 

25 offspring will be transgenic as well. Usually, at 

least 40% of the mammals developing from the injected 
eggs contain at least one copy of the cloned construct 
in somatic tissues and these "transgenic mammals" 
usually transmit the gene through the germ line to the 

30 next generation. DNA isolated from the tissue of the 
resulting transgenic mammal may be tested for the 
presence of the island of expression by Southern blot 
analysis. If one or more copies of the island of 
expression remains stably integrated into the genome of 
35 such transgenic mammals, it is possible to establish 
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penaanen^ teansgenic masoaal lines carrying the island 
of expression construct. 

The offspring of transgenically altered 
mammals may be assayed after birth for the 
5 incorporation of the island of expression construct 
into the genome. Preferably^ this assay is 
accomplished by Southern hybridization of chromosomal 
material from the progeny using a probe corresponding 
to a portion of the heterologous polypeptide coding 

10 sequence. Those mammalian progeny found to contain at 
least one copy of the construct in their genome are . 
grown to maturity. In a preferred embodiment of this 
invention, the female species of these progeny will 
produce the desired heterologous polypeptide in or 

15 along with their milk. Alternatively, the transgenic 
mammals may be bred to produce other transgenic progeny 
useful in producing the desired heterologous 
polypeptides. 

EXAMPLES 

20 EXAMPLE 1 • CONSTRUCTION OF THE BOVINE ALPHA 

S-1 CASEIN ISLAND OF EXPRESSION 

One example of this technology is to utilize 
the island of expression construct to produce a 
heterologous protein in a specific tissue or organ 

25 system of an intact animal. In this case we directed 
high level expression of a heterologous protein in the 
mammary gland of a mammal. 

The gene construct described here contains an 
"island of expression" in which large 5* and 3" 

30 flanking regions of genomic sequence from the bovine 
alpha casein gene direct expression of the genomic 
clone of hximan urokinase. The 5' flanking region 
consists of 21 kb of upstream alpha casein sequences, 
including the first non^coding exon and the non-coding 
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portion of the second exon. The 9 kb 3 ' flanking 
region consists of the exons encoding the COOH- terminal 
half of alpha casein, the polyadenylation signal, and 2 
kb of further downs treaun flanking sequences. 
5 We cloned the bovine alpha S*l casein gene 

(CAS) from a cosmid library of calf thymus DNA in the 
cosmid vector HC79 (from Boehringer Mannheim) as 
described by B. Hohn and J. Collins, "A Small Cosmid 
For Efficient Cloning Of Large DNA Fragments", Gene , 
10 11(3-4), pp. 291-98 (1980). The thymus was obtained 
from a slaughterhouse and the DNA isolated by standard 
techniques well known in the art (T. Maniatis et al.. 
Molecular Cloning; A Laboratory Manual at page 271, 
Cold Spring Harbor Laboratory 1982)). We constructed 
15 the cosmid library using standard techniques (F. 

Grosveld et al., "Isolation Of Beta - Globin - Related 
Genes From A Human Cosmid Library", Gene , 13(3), 
pp. 227-31 (1981)). We partially digested the calf 
thymus DNA with Sau 3A (New England Bio Labs) and ran it 
20 on a NaCl gradient (IM to 5M) to enrich for 3 0 to 4 0 kb 
fragments. The partially digested DNA fragments were 
then ligated into the BamHI digested HC79 cosmid 
vector, followed by In vitro packaging by lambda 
extracts (Amersham Corporation, Arlington Heights, XL) 
25 according to the manuf actxirer • s instructions. The in 
vitro packaged material was then used to transfect the 
E>coli K-12 strain HBlOl* Clones incorporating this 
vector were selected by growth on LB plates containing 
50 ^ig/ml of Ampicillin (Sigma Chemical Co., St. Louis, 
30 HO) . 

We screened the resulting library using a 45 
base pair oligonucleotide probe, CAS-1. This CAS-1 
sequence , 5 • -ATGGCTTGATCTTCAGTTGATTCACTCCCAATATCCTTGC 
TCAG-3 * , was synthesized based upon a partial cDNA 
35 sequence of alpha S-1 casein described by Z.H. Willis 
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et al., "Construction And Identif iction By Partial 
Nucleotide Sequence Analysis Of Bovine Casein And Beta- 
Lactoglobulin cDNA Clones", DNA > 1(4), pp. 375-86 
(1982) • This sequence corresponds to amino acids 20- 
5 35 of mature bovine casein. As a result of this 

screening, we isolated three clones containing cosmids 
(C9, D4 and El) . 

The 5' and 3« flanking sequences were 
obtained from cosmid clones. El and C9. Restriction 

10 mapping and Southern blot analysis (E. Southern, 

"Detection Of Specific Sequences Among DNA Fragments 
Separated By Gel Electrophoresis", J. Mol, Biol, , 98 
(3), pp. 503-517 (1975)) using oligonucleotide probes 
corresponding to known sequenced regions of the casein 

15 cDNA (A.F. Stewart et al., "Nucleotide Sequences Of 

Bovine Alpha SI- And Kappa-Casein cDNAs", Nucleic Acids 
Res. . 12(9), pp. 3895-3907 (1984); M. Nagao et al., 
"Isolation And Sequence Analysis Of Bovine Alpha Sl- 
Casein cDNA Clone", Aaric> Biol. Chem, . 48(6), 

20 pp. 1663-67 (1984)) established that cosmids D4 and El 
contained part of the casein structural gene (DNA 
sequence coding for the casein protein) and 21 kb of 
upstream or 5* flanking sequences. The C9 cosmid 
contained part of the casein structural gene and 

25 extended to 7 kb downstream of the polyadenylation 
sequence. We sequenced the cosmids El and D4 in the 
region corresponding to the transcriptional start of 
the casein structural sequence and determined that the 
sequence corresponded to that of a published sequence 

30 of the same region. (L.Y. Yu-Lee et al., "Evolution Of 
The Casein Multigene Family: Conserved Sequences In The 
5* Flanking And Exon Regions", Nucleic Acid Res. > 
14(4), pp. 1883-1902 (1986)). 

The construction of this island of expression 

35 in this invention is depicted in Figure 2. From the C9 
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cosmid we subcloned the 9 kb EamHI fragment which 
begins at a Bam HI site within the intron following 
amino acid # 98 of alpha casein and continues to 
another Bam site located 2 kb downstream of the 
5 polyadenylation signal of alpha casein. This fragment 
is labelled as "C-term" in the Figure 2. This 9 kb 
fragment was cloned into BamH I -cut pUC19 to yield 
pCAS947. The doxmstream BamH I site was converted to a 
Sai l site by partial digestion of pCAS947 with BaffiHI 

10 and subsequent ligation with a Sai l linker, CAS 10, 

having the sequence, 5 " -GATCGTCGAC-3 • . The resulting 
plasmid was termed pCAS1238. This 9 kb BamHI -Sail 
fragment was used as the 3 • flanking sequence of the 
"island". It contains the 3" untranslated region and 

15 3* expression control sequences and a portion of the 
structural gene from alpha S-1 casein. 

The next step was to design the 5« flanking 
region. The region containing the transcriptional 
start, a non-coding axon and a second exon, part of 

2 0 which was also non-coding, was subcloned. A 4 kb 

Sma l / Bam HI fragment from cosmid El was isolated and 
subcloned into BamHI /Smal -cut pUC19 to yield pCAS1176. 
The plasmid was cut with Bglll, to remove the coding 
part of the second exon, and then the Bglll site was 

25 converted to a BarnHI site by ligation to a CAS 12 

linker having the sequence, 5 ■ -GATCTTGGATCCAA-3 • . The 
resulting plasmid, pCASllSl, was then digested with 
Sma l and Bam HI to remove the 3 kb piece of cosmid El 
DNA. The fragment was isolated, ligated to the 9 kb 

30 BamH I- Sal l fragment from pCAS1238, and inserted into 
the Smal /Sail digested pUC19 to yield pCAS1276. 

The resulting construct links the 
transcriptional start site to the downstream genomic 
sequence with a unique BamHI cloning site in between, 

35 into which the heterologous polypeptide encoding 
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sequence can be inserted. Since the final constructs 
will have several other BajsiHI sites in the genomic 
sequences, the heterologous polypeptide encoding 
sequence cloning site was changed to both an Xhol site 
5 and a Not I site by the addition of a linker, CAS 30, 
having the sequence, 5 • -GATCTCGAGCGCGGCCGCGCT-3 • . The 
resulting vector, pCASl277, contains Xhal and Notl 
sites as cloning sites in between the transcriptional 
start of alpha casein and the C-tenainal genomic 
10 portion of alpha casein. 

The transcriptional start and C-term regions 
from PCAS1277 were then used to replace the 
corresponding portions of the alpha casein genomic 
sequence found in the cosmid El. Since the construct 
15 is 39 kb in length, cosmid technology was used to 

manipulate the plasmids. The original El cosmid was 
partially digested with Xmal. followed by digestion to 
completion with Sai l to remove the 3'- most portion of 
the alpha casein gene contained in that cosmid. The 
20 Sma l and Xma l enzymes have the same recognition site, 
except that Xma l leaves a 5' overhang whereas Smal 
leaves a blunt end. The 12 kb Xma l- Sal l fragment from 
PCAS1277 was then inserted into the Xmal/Sall-cut 
cosmid to replace the removed portion. 
25 The ligated products were subjected to in 

vitro packaging using an in vi^ro packaging kit 
(Amersham Corporation) and the packaged DNA was used to 
transfect E.coli DH5 cells, followed by selection on LB 
plates containing 50 fig/v^l of ampicillin (Sigma 
30 Chemical Co.). The plasmids from ampicillin-resistant 
colonies were screened using oligonucleotide probes 
specific for the 3" end of casein. We identified and 
characterized plasmids which contain 21 kb upstream of 
the transcriptional start and the 2<hol/NQtI cloning 
35 site along with the genomic 3' end of the casein gene. 
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One of these plasmids, CAS1288, was then used to 
express the heterologous DNA sequence. 

The genomic clone of human urokinase was 
isolated from a genomic library using published 
5 sequences as probes. A. Riccio et al., supra . From 
the published sequence, it can be seen that there is an 
Apa l site upstrecun of the translational start of the 
gene and also downstream of the polyA transcriptional 
signal. Oligonucleotide adapters (URO 8, having the 

10 sequence 5 ' -CGTCGACG-3 • , and URO 9 , having the sequence 
5'-GTACCGTCGACGGGCC-3 • ) were used to add Sai l sites to 
these two flanking Apa l sites- This allowed the 
genomic clone to be placed downstream of the SV4 0 early 
promoter in an animal cell expression vector so that we 

15 could test for expression prior to insertion in the 
alpha casein island of expression. The resulting 
plasmid, pUK0409, directed expression of authentic 
human urokinase in transfected tissue culture cells. 
We therefore knew that the genomic clone was 

20 functional. The next step was to put the urokinase 
genomic clone into the Xho l cloning site of CAS1288. 
These steps are depicted in Figure 3. 

The urokinase genomic clone was isolated as 
an 8 kb Sai l fragment from pUK0409. The Sail 

25 overhanging ends are capable of ligating into the 2*L2l 
cloning site found in CAS1288. There is, however, 
another Xho l site in the 21 kb upstream region of alpha 
casein. We therefore carried out partial 23l2l 
digestions, followed by ligation with the isolated Sail 

30 urokinase fragment (see Figure 3). Plasmids were 
isolated from colonies and screened for the presence 
and orientation of the urokinase DNA sequence. One of 
these plasmids, CAS1295, contained the uorkinase gene 
in the correct orientation as determined by restriction 

35 analysis. This plasmid contains in a 5*- to -3' 
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After isolating the embryo, we first digested 
away the cximulus cells surrounding the egg with 
hyaluronidase. The island of expression construct was 
then injected into the pronucleus of the embryo until 
5 it swelled 30% to 50% in size. We then implanted the 
injected embryos into the oviducts of pseudopregnant Fl 
females. DNA from the tails of the resulting live 
offspring was probed with nick translated CAS1295 DNA 
to identify those animals which carried -the island of 

10 expression contruct. Three transgenic animals were 
identified. These animals were mated and the progeny 
tested for the presence of the island of expression 
construct as described supra . 

One of the transgenic lines, which carried 

15 2-3 copies of the island of expression construct, 

passed the genetic material in a Mendelian manner. The 
females of this transgenic line, which carry the 
CAS1295 insert, all produce human urokinase in their 
milk at about 1 mg/ml, as determined by enzymatic 

2 0 assay. The urokinase is inhibited by the monoclonal 
antibody #394, specific for human urokinase (Americana 
Diagnostica, Inc., New York, NY). 

The other two transgenic lines carried 20-50 
copies of the construct but failed to pass the DNA to 

25 the next generation of mice. We believe' that the 
inability of the high copy number lines to pass the 
genes is due to the high basal level of the urokinase 
during embryogenesis. Urokinase is normally expressed 
in fetal tissue (embryonic stem cells) and may function 

30 in development. The low basal level of urokinase 

expression from the casein expression control sequences 
would not interfere with development in those embryos 
inheriting two copies of the gene. However, if 
expression is dependent upon copy number, those lines 
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which have 20-50 copies would have 20-50 fold higher 
basal level and would tiierefore express enough 
urokinase to interfere with proper development. These 
results indicate that the level of urokinase expressed 
5 is copy number dependent. 

EXAMPLE 3 - TRANS FECTION OF THE ISLAND OF 

EXPRESSION CONSTRUCT INTO ANIMAL CELLS 

The island of expression construct and the 
the selectable marker pSV2-DHFR (available from the 

10 American Type Culture Collection (ATCC 37146) ) which 
codes for the production of dihydrof olate reductase In 
mammalian cells, are cointroduced into DHFR* CHO cells 
by electroporation. This technique is chosen for its 
ability to produce host cells characterized by stably 

15 integrated foreign DNA at high copy numbers. European 
Patent Application 0 343 783 fully describes this 
technique and is incorporated herein by reference. 

Prior to electroporation, the pSV2-DHFR 
plasmid is linearized by digestion overnight at 37*>C 

20 with Aat ll . The island of expression sequences are 
isolated from the vector sequences by cutting with 
restriction enzymes as described in Example 2, followed 
by gel electrophoresis to allow separation and 
purification (Maniatis et al., supra ^ . Salmon sperm 

25 DNA (200 /ig) , previously sonicated to 300-1000 bp 

fragments, is added to a mixture containing 200 fig of 
the linearized pSV2-DHFR and 0.5 mg/ml of the island of 
expression construct. To precipitate the mixture of 
DNAs, NaCl is added to a final concentration of 0.1 M. 

30 Next, 2.5 volumes of ethanol are added and the mixture 
is incubated for ten minutes on dry ice. After a ten 
minute centrifugation at 4*C, the ethanol is aspirated 
and the DNA pellet is air-dried for 15 minutes in a 
tissue culture hood. The DNA pellet is then 

35 resuspended in 800 til of IX HeBS (20 mH Hepes/NaOH, pH 
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7.05; 137 mM NaCl; 5 mM KCl; 0.7 mM Na^O^; 6mM 
dextrose) for at least two hours prior to 
electroporat ion • 

Approximately 2 x 10^ DHFR" CHO cells 
5 (subcloned from the clone designated CHO-DUKX-Bl of 
Urlaub and Chasis, "Isolation Of Chinese Hamster Cell 
Mutants Deficient In Dihydrof olate Reductase Activity", 
Proc. Natl. Acas. Sci. UAS , 77, pp. 4216-20 (1980)) are 
used for each electroporat ion. The DHFR~ CHO cells are 
10 passaged on the day prior to electoporation and are 

approximately 50% confluent on 10 cm plates at the time 
of harvesting for electroporat ion. The DHFR" CHO cells 
are detached from the plates by trypsin treatment and 
the trypsin subsequently inactivated by the addition of 
15 8.0 ml a* medium (MEM alpha supplemented with 

ribonucleotides and deoxyribonucleotides (10 mg/L each 
of adenosine, cytidine, guanosine, uridine, 2*- 
deoxyadenosine, 2 • -deoxyguanosine and 2'-deoxy- 
thymidine; 11 mg/L of 2 ' •deoxycytidine hydrochloride) 
20 (Gibco Laboratories, Grand Island, NY) , 10% fetal 

bovine serum (Hazelton, Lenexa, KS) and 4 mM glutamine 
(M.A. Bioproducts, Walkersville, MD) ) per plate. The 
cells detached from the plates are then collected and 
centrifuged at 1000 rpm for 4 minutes. The majority of 
25 the medium is aspirated off the cell pellet and the 
cells resuspended in the remaining residual media by 
flicking the tube. 

The island of expression, pSV2-DHFR and 
salmon sperm DNA, suspended in 800 ^1 IX HeBS, are then 
30 added to the DHFR" CHO cell suspension. The resulting 
mixture is immediately transferred to an 
electroporation cuvette. The capacitor of the 
electroporat ion apparatus is set at 960 and the 
voltage set at 300V. A single pulse, lasting 
35 approximately 10 milliseconds, is delivered to the 
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contents of the cuvette at room temperature. The cells 
are then incubated for 8-10 minutes at room temperature 
and then transferred to a 15 ml tube containing 14 ml 
of a* medixim. The cells are centrifuged as above. 
5 After aspirating the medixim, the wet cell pellet is 

resuspended by flicking the tube and fresh a* medium is 
added. The suspended cells are then seeded into 
culture plates in non-selective meditim for 2 days to 
allow them to recover from electroporation and express 

10 the selective gene. Approximately 20-30% of the viable 
CHO cells are expected to incorporate the island of - 
expression/pSV2-DHFR and thus survive the selection 
process. Therefore, approximately 1 x 10^ total cells 
per 10 cm plate are seeded and cultured in a 37 ^C, 5.5% 

15 CO^ incubator. 

After a recovery period of two days, the 
cells are removed from the culture plates by trypsin 
treatment as described above, counted and seeded into 
six 10 cm plates at a density of about 1 x 10^ cells 

20 per plate, in a" medium (Sigma Chemical Co.). The 
cells containing the island of expression and pSV2- 
DHFR are selected after a 4 day incubation in the a* 
media. The selected cells are then tested for 
expression of urokinase by standard techniques, e.g, a 

25 commercially available colorometric test, Spectrozyme 
UK (Americana Diagnostica, Inc.) 

Several clones that have various levels of 
expression of urokinase are selected. DNA and RNA are 
isolated from these clones and Northern and Southern 

30 analysis is carried out to determine transcription 
level and copy number of the island of expression 
construct. This analysis reveals whether expression of 
the uorkinase message is a function of the copy niimber 
and independent of the site of integration of the 

35 integrated construct. 
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A construct according to this invention 
containing plasmid CAS1288 is exemplified by a culture 
deposited at In Vitro International, Inc. in Linthicum, 
Maryland, on February 1, 1990 and there identified as 
5 CAS1288 wherein the plasmid CAS1288 is in E> coli DH5* 
It has been assigned accession number IVI 10232. 

A second construct according to this 
invention containing plasmid CAS1295 is exemplified by 
a culture deposited at In Vitro International, Inc. in 
10 Linthicum, Maryland, on February 1, 1990 and there 

identified as CAS1295 wherein the plasmid CAS1295 is in 
E.coli DH5. It has been assigned accession number IVI 
10231. 

While we have hereinbefore presented a number 
15 of embodiments of our invention, it is apparent that 
otir basic construction may be altered to provide other 
embodiments which utilize the processes and 
compositions of this invention. Therefore, it will be 
appreciated that the scope of this invention is to be 
20 defined by the claims appended hereto, rather than the 
specific embodiments which have been presented 
hereinbefore by way of example. 
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He claim: 

1. A process for producing a high t:he level 
of a desired heterologous polypeptide in a host, the 
process comprising the steps of: 

a) integrating at least one island of 
expression into the genome of said host, wherein said 
island of expression comprises, in the 5* to 3 * 
direction, a 5' flanking region^ a heterologous 
polypeptide encoding secpience and a 3* flanking region; 
said 5» flanking region comprising 5' expression 
control sequences, operatively linked to said 
heterologous polypeptide encoding sequence and a 5* 
untranslated region; said 3' flanking region 
comprising, a 3» untranslated region, and 3» expression 
control sequences, operatively linked to said 
heterologous polypeptide encoding sequence; and the 5' 
and 3* flanking regions of said islands of expression 
being of sufficient size and structure effective to 
render the level of production of the desired 
heterologous polypeptide substantially dependent on the 
copy number of the island of expression integrated into 
the host genome and substantially independent of the 
position of integration of the island of expression in 
the host genome; and 

b) culturing said host under 
conditions which allow said desired heterologous 
polypeptide to be expressed. 

2. The process according to claim 1 wherein 
said heterologous polypeptide encoding sequence 
comprises a functional signal sequence coding region. 

3. The process according to claim 2 wherein 
the signal sequence coding region is derived from a 
milk specific protein gene. 
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4. The process according to claim 3 wherein 
the milk specific protein gene is casein. 

5o The process according to any one of 
claims 2 to 4 wherein the host is a lactating mammal 
selected from the group consisting of mice, cows, 
sheep, goats and pigs and the 5' and 3* flanking 
sequences are derived from a milk specific protein 
gene • 

6. The process according to claim 1 wherein 
the heterologous polypeptide encoding sequence is 
selected from sequences encoding a polypeptide selected 
from the group consisting of: tPA, urokinase, Mullerian 
Inhibiting Substance, interferons, coagulation factors 
VIII and IX, animal growth hormones, insulin, 

inter leukins, immunoglobulins and lipocortins. 

7. An island of expression DNA sequence 
comprising, in the 5 • to 3 • direction, a 5' flanking 
region, a heterologous polypeptide encoding sequence 
and a 3' flanking region; the 5» flanking region 
comprising 5* expression control sequences operatively 
linked to the heterologous polypeptide encoding 
sequence and a 5« untranslated region; the 3* flanking 
region comprising a 3» untranslated region, and 3' 
expression control secjuences, operatively linked to the 
heterologous polypeptide encoding region; whereinupon 
the integration of the island of expression into the 
genome of a host, the 5« and 3* flanking regions of the 
island of expression are of sufficient size and 
structxire effective to render a level of production of 
a polypeptide encoded by the heterologous polypeptide 
encoding sequence substantially dependent on the copy 
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